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WHAT IS CLAIMED IS: 

1 1. A method of analyzing a data source, said method 

2 comprising: 

3 comparing the data source to a reference file; 

4 determining whether the data source is balanced in 

5 response to the comparing; and 

6 adjusting the data source based on the determining, 

7 wherein the adjusting results in a more balanced 

8 data source. 

1 2. The method as described in claim 1 further comprising: 

2 matching one or more records from the data source to 

3 one or more reference file records; 

4 generating a comparison master file based on the 

5 matching; and 

6 assigning an index number to each record in the 

7 comparison master file. 

1 3. The method as described in claim 1 further comprising: 

2 retrieving a rule corresponding to an element in the 

3 data source; 

4 determining whether the element in the data source 

5 approximates a corresponding value in the 

6 reference file based on the retrieved rule; and 

7 assigning a match to the element in response to the 

8 determination. 

1 4. The method as described in claim 1 further comprising: 

2 matching one or more records from the data source to 

3 one or more reference file records; and 

4 calculating a first bias value based upon the 

5 matching. 
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The method as described in claim 4 further comprising: 
matching one or more records from a second data source 

to one or more reference file records; 
calculating a second bias value based upon the 

matching; and 
comparing the first bias value to the second bias 

value . 

The method as described in claim 1 further comprising: 
identifying a first data source sample size; 
comparing a first data source sample corresponding to 

the first data source sample size to the 

reference file; 
determining a match percentage based on the comparing; 

and 

calculating a second data source sample size by 

dividing the first data source sample size by the 
match percentage. 

The method as described in claim 6 further comprising: 
identifying a second data source corresponding to the 

second data source sample size; 
matching one or more records from the second data 

source to one or more reference file records; and 
calculating a second match percentage based on the 

matching. 

An information handling system comprising: 
one or more processors; 
a memory accessible by the processors; 
one or more nonvolatile storage devices accessible by 
the processors; 
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6 a data source handling tool to manage a data source 

7 stored on one of the nonvolatile storage devices, 

8 the data source handling tool including: 

9 means for comparing the data source to a 

10 reference file stored on one of the 

11 nonvolatile storage devices; 

12 means for determining whether the data source is 

13 balanced in response to the comparing; and 

14 means for adjusting the data source based on the 

15 determining, wherein the adjusting results 

16 in a more balanced data source. 

1 9. The information handling system as described in claim 

2 8 further comprising: 

3 means for matching one or more records from the data 

4 source to one or more reference file records; 

5 means for generating a comparison master file based on 

6 the matching; and 

7 means for assigning an index number to each record in 

8 the comparison master file. 

1 10. The information handling system as described in claim 

2 8 further comprising: 

3 means for retrieving a rule corresponding to an 

4 element in the data source from one of the 

5 nonvolatile storage devices; 

6 means for determining whether the element in the data 

7 source approximates a corresponding value in the 

8 reference file based on the retrieved rule; and 

9 means for assigning a match to the element in response 
10 to the determination. 



Docket No. AUS9-2001-0646-US1 



Atty. Ref. No. IBM-1036 



1 11. The information handling system as described in claim 

2 8 further comprising: 

3 means for matching one or more records from the data 

4 source to one or more reference file records; and 

5 means for calculating a first bias value based upon 

6 the matching. 

1 12. The information handling system as described in claim 

2 8 further comprising: 

p 3 means for matching one or more records from a second 
% a 4 data source to one or more reference file 

*j£ 5 records; 

pi 6 means for calculating a second bias value based upon 
Tz 7 the matching; and 

s 8 means for comparing the first bias value to the second 
55 9 bias value. 

D 1 13. The information handling system as described in claim 

M; 2 8 further comprising: 

3 means for identifying a first data source sample size; 

4 means for comparing a first data source sample 

5 corresponding to the first data source sample 

6 size to the reference file; 

7 means for determining a match percentage based on the 

8 comparing; and 

9 means for calculating a second data source sample size 
1° by dividing the first data source sample size by 
11 the match percentage. 

1 14. The information handling system as described in claim 

2 13 further comprising: 
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means for identifying a second data source 

corresponding to the second data source sample 
size; 

means for matching one or more records from the second 
data source to one or more reference file 
records ; and 

means for calculating a second match percentage based 
on the matching. 

15. A computer program product stored in a computer 
operable media for managing a data source, said 
computer program product comprising: 

means for comparing the data source to a reference 
file; 

means for determining whether the data source is 
balanced in response to the comparing; and 

means for adjusting the data source based on the 

determining, wherein the adjusting results in a 
more balanced data source. 

16. The computer program product described in claim 15 
further comprising : 

means for matching one or more records from the data 
source to one or more reference file records; 

means for generating a comparison master file based on 
the matching; and 

means for assigning an index number to each record in 
the comparison master file. 

17. The computer program product described in claim 15 
further comprising: 
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3 means for retrieving a rule corresponding to an 

4 element in the data source from the nonvolatile 

5 storage area; 

6 means for determining whether the element in the data 

7 source approximates a corresponding value in the 

8 reference file based on the retrieved rule; and 

9 means for assigning a match to the element in response 
10 to the determination, 

1 18. The computer program product described in claim 15 

2 further comprising: 

3 means for matching one or more records from the data 

4 source to one or more reference file records; and 

5 means for calculating a first bias value based upon 

6 the matching. 

1 19. The computer program product described in claim 15 

2 further comprising: 

3 means for matching one or more records from a second 

4 data source to one or more reference file 

5 records; 

6 means for calculating a second bias value based upon 

7 the matching; and 

8 means for comparing the first bias value to the second 

9 bias value. 

1 20. The computer program product described in claim 15 

2 further comprising: 

3 means for identifying a first data source sample size; 

4 means for comparing a first data source sample 

5 corresponding to the first data source sample 

6 size to the reference file; 
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means for determining a match percentage based on the 

comparing; and 
means for calculating a second data source sample size 

by dividing the first data source sample size by 

the match percentage. 



